82 results found.
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Basque English Finnish French Hungarian Romanian
Availability:
Freely Available
License:
MIT License
Size:
8130 sentences Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible
-
Paper track:Speech/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Marcely Zanon Boito | MaSS dataset | /N |
Documentation:
Documentation in English at the github pageLanguage Type:
Multilingual
Languages:
Finnish North Sami
Availability:
From Owner
License:
<Not Specified>
Size:
12 hours Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:Open-domain Interaction and Online Content in the Sami Language
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Kristiina Jokinen | University of Helsinki | JP | AIRC, AIST | JP |
| Main Contact | Kristiina Jokinen | AIRC, AIST | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English Finnish German Russian french
Availability:
Freely Available
License:
CC-BY
Size:
65 languages, 2.60 billion sentence fragments OtherProduction Status:
Existing-used
Use:
Textual Entailment and Paraphrasing
-
Paper title:Open Subtitles Paraphrase Corpus for Six Languages
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Mathias Creutz | University of Helsinki | FI |
| Main Contact | Mathias Creutz | University of Helsinki | None |
Documentation:
See web page
Written
Lexicon,
Language Type:
Multilingual
Languages:
Danish English Finnish Norwegian Swedish
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> OtherProduction Status:
Newly created-finished
Use:
Lexicon Creation/Annotation
-
Paper title:The Gavagai Living Lexicon
-
Paper track:Terminology
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Magnus Sahlgren | Gavagai | SE |
| Author 2 | Amaru Cuba Gyllensten | <Not Specified> | None |
| Author 3 | Fredrik Espinoza | Gavagai | SE |
| Author 4 | Ola Hamfors | Gavagai | SE |
| Author 5 | Jussi Karlgren | Gavagai | SE |
| Author 6 | Fredrik Olsson | Gavagai | SE |
| Author 7 | Per Persson | Gavagai | SE |
| Author 8 | Akshay Viswanathan | Gavagai | SE |
| Author 9 | Anders Holst | SICS | SE |
| Main Contact | Magnus Sahlgren | Gavagai | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Danish Dutch Finnish Mandarin Chinese Standard Arabic
Availability:
Freely Available
License:
<Not Specified>
Size:
4.2 MByte Production Status:
Newly created-finished
Use:
Person Identification
-
Paper title:Creating and Curating a Cross-Language Person-Entity Linking Collection
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Dawn Lawrie | Loyola College in Maryland | None |
| Author 2 | James Mayfield | Johns Hopkins University | None |
| Author 3 | Paul McNamee | Johns Hopkins University | None |
| Author 4 | Douglas Oard | University of Maryland | None |
| Main Contact | Dawn Lawrie | Loyola University Maryland | US |
Documentation:
Documentation in English with Download
Written
Lexicon,
Language Type:
Multilingual
Languages:
English Finnish Japanese Portuguese french
Availability:
Freely Available
License:
GNU LGPL 2.1
Size:
More than 2 million entries Production Status:
Existing-updated
Use:
General Lexical Resource
-
Paper title:Attaching Translations to Proper Lexical Senses in DBnary
-
Paper track:long paper
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Andon Tchechmedjiev | <Not Specified> | FR |
| Author 2 | Gilles Sérasset | Univ Grenoble Alpes | None |
| Author 3 | Jérôme Goulian | Univ Grenoble Alpes | None |
| Author 4 | Didier Schwab | Univ Grenoble Alpes | None |
| Main Contact | Andon Tchechmedjiev | IMT Mines Alès | None |
Documentation:
http://dbnary.forge.imag.fr/ Documentaion in English
Written
Corpus,
Language Type:
Multilingual
Languages:
English Finnish
Availability:
Freely Available
License:
<Not Specified>
Size:
77048083 words Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Nikola Ljubešić | University of Zagreb | HR | ||
| Author 2 | Miquel Esplà-Gomis | Universitat d'Alacant | ES | ||
| Author 3 | Antonio Toral | Dublin City Unversity | IE | ||
| Author 4 | Sergio Ortiz Rojas | <Not Specified> | None | ||
| Author 5 | Filip Klubička | University of Zagreb | HR | ||
| Main Contact | Nikola Ljubešić | Jožef Stefan Institute | None | University of Zagreb | None |
Documentation:
<Not Specified>
Sign Language
Evaluation Data,
Language Type:
Monolingual
Languages:
Finnish
Availability:
From Data Center(s)
License:
To be decided
Size:
40 GByte Production Status:
Newly created-in progress
Use:
Sign Language Recognition/Generation
-
Paper title:S-pot - a benchmark in spotting signs within continuous signing
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Ville Viitaniemi | Aalto University School of Science | FI | ||||
| Author 2 | Tommi Jantunen | <Not Specified> | None | University of Jyväskylä | FI | University of Jyväskylä | None |
| Author 3 | Leena Savolainen | Finnish Association of the Deaf | FI | ||||
| Author 4 | Matti Karppa | Aalto University | FI | ||||
| Author 5 | Jorma Laaksonen | Aalto University School of Science | FI | ||||
| Main Contact | Ville Viitaniemi | Aalto University School of Science | None |
Documentation:
To be created and included in the data distribution
Written
Lexicon,
Language Type:
Multilingual
Languages:
Finnish
Availability:
Freely Available
License:
CC BY-SA 3.0
Size:
340787 entries Production Status:
Existing-used
Use:
Lexicon Creation/Annotation
-
Paper title:Evaluation of Dictionary Creating Methods for Finno-Ugric Minority Languages
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Zsanett Ferenczi | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 2 | Iván Mittelholcz | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 3 | Eszter Simon | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 4 | Tamás Váradi | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Main Contact | Eszter Simon | Research institute for Linguistics, Hungarian Academy of Sciences | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
English Finnish
Availability:
Freely Available
License:
CreativeCommons (CC-BY 3.0)
Size:
117659 <Not Specified>Production Status:
Existing-updated
Use:
Word Sense Disambiguation
-
Paper title:Representing the Translation Relation in a Bilingual Wordnet
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jyrki Niemi | University of Helsinki | None | ||
| Author 2 | Krister Lindén | University of Helsinki | None | University of Helsinki | FI |
| Main Contact | Jyrki Niemi | University of Helsinki | FI |
Documentation:
Readme files in English and Finnish are publicly available




